



(N 

o 

(N 
C 

00 



O 
O 

Oh 



> 
(N 
On 
0O 

m 

O 



This is an unedited preprint. The original publication is available at 



http:/ /www.springerlink.com 



X 



http://www.doi.Org/10.1023/A:1012606309138 



Preprint (2000) 0-4 



1 



Interpretation of Mossbauer spectra in the energy and 
time domain with neural networks 

H. Paulsen^ R. LinderElF. Wagner E H. Winkler^ S.J. PopplEA.X. Trautwein^ 

a Institut fur Physik, Med. Universitat Liibeck, D-23538 Liibeck, Germany 
E-mail: paulsen@physik.mu-luebeck.de 
b Institut fur Med. Informatik, Med. Universitat Liibeck, D-23538 Liibeck, Germany 
c Institut fur Theoretische Physik, Christian-Albrechts-Universitat, D-2^118 Kiel, Germany 

An artificial neural network for extracting reasonable and fast estimates of hyper- 
fine parameters from Mossbauer spectra in the energy or time domain is outlined. 
First promising results for determining the asymmetry of the electric field gradient 
at the nucleus of a diamagnetic iron center as derived with different types of neural 
networks are reported 
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Artificial Neural networks (ANN) are used for the analysis of experimental 
data in a broad range of scientific disciplines. In Mossbauer spectroscopy ANN 
have been used to detect corrosion products [lj or to establish databases of known 
spectral patterns [2]. In the work outlined here ANN are used to build a pro- 
gram package that allows to obtain a reasonable and fast estimate of hyperfine 
parameters, e.g. the isomer shift, the electric field gradient tensor, the magnetic 
hyperfine interaction tensor, from experimental Mossbauer transmission spectra 
(MTS) or nuclear forward scattering (NFS) spectra. The fast estimate of these 
parameters is obligatory when a fast decision is required whether and under 
which conditions (temperature, applied field) further measurements have to be 
performed, especially if time is a limiting factor like for recording NFS spectra 
at 3 rd generation machines. Additionally, such an estimate is useful in providing 
a set of start parameters for, e.g., a conventional spin-Hamiltonian analysis of 
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the experimental spectra |3|4| . In a first step, that is reported here, only the 
determination of the asymmetry parameter r\ of the electric field gradient at the 
iron nucleus of a diamagnetic and randomly oriented complex is considered. 

The development of this program package includes an ANN with supervised 
learning and consists of three parts: (i) choosing appropriate network architec- 
tures and learning rules, (ii) training the ANN, and (iii) testing the performance 
of the ANN with a set of spectra that have not been used for training. In the 
present study we have used feed-forward networks [5] with one or two hidden 



layerq^Jof neurons and two different learning rules, i.e. an improved backpropa- 

gation algorithm (APROP) [6j and, included in the NETFIT program package [7J, 
a variable metric minimization algorithm (MINIM) [8]. The ANN was realized as 
C++ (APROP) or Fortran (NETFIT) programs, respectively. Each neuron is re- 
alized by a set of variables containing (i) the addresses of the connected neurons, 
(ii) for each connection a weight, that is determined during the training process, 
and (iii) the current value of the neuron representing a data point of the mea- 
sured spectrum (input layer), the retrieved parameter (output layer), or data of 
no obvious physical meaning (hidden layers) . To train the ANN 5000 Mossbauer 
transmission spectra have been simulated using the spin-Hamiltonian formalism 
|3j together with powder averaging. While the linewidth T = 0.37 mm/s was kept 
fixed for all spectra, the quadrupole splitting AEq, the asymmetry parameter r], 
and the external field B were chosen at random from the intervals [1. . .4 mm/s], 
[0. . .1], and [4. . .8 T], respectively. Noise corresponding to an off-resonance count 
rate of 10 6 was introduced using random numbers with Poisson distribution. As 
input for the ANN the normalized transmission count rates in 256 channels cov- 
ering the velocity range from -8 to +8 mm/s were used. In the same way NFS 
spectra have been simulated using the SYNFOS program [4J with a simulated 
noise level that corresponds to a maximum count rate of 1000. 

During training each ANN output, ?? nc t, was compared with the actual value 
rj of its corresponding input spectra. The sum x 2 (T) of the squared differences, 



1 One hidden layer is necessary and sufficient to overcome the limitations of a perceptron, 
however two hidden layers might be more efficient in some cases. 




X 2 (T) = J ! r S2\ri(s)-Vnet(s)\ 2 



(1) 
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Table 1 



Performance of different ANNs for Mossbauer spectra in energy (MTS) and time domain (NFS). 



Spectra 


Preprocessing 


Hidden 


learning 


Training 


Generalization 






neurons 


rules 


x 2 


P0.05 


x 2 


P0.05 


MTS 


no 


3 


MINIM 


0.07 


0.55 


0.09 


0.44 


MTS 


no 


2 x 20 


APROP 


0.05 


0.81 


0.08 


0.57 


MTS 


yes (Lorentzian fit) 


5 


MINIM 


0.05 


0.75 


0.06 


0.71 


MTS 


yes (Lorentzian fit) 


2 x 20 


APROP 


0.04 


0.82 


0.06 


0.78 


NFS 


no 


4 


MINIM 


0.06 


0.63 


0.09 


0.55 


NFS 


no 


2 x 20 


APROP 


0.08 


0.76 


0.06 


0.65 



where the index s enumerates the 5000 different spectra of the training set T, is 
a measure for the memorization ability of the ANN. x 2 {T) was minimized itera- 
tively by changing as internal ANN parameters the weights w%j between neurons i 
and j of adjacent layers. In case of APROP supervision of the training procedure 
demands a second, independent set of 1000 spectra, which was generated in the 
same way as the training set. After training was completed the ANN was kept 
fixed and its performance was tested by an additional independent set G contain- 
ing 1000 spectra. The performance, or more precisely, the generalization ability 
of the trained ANN is represented by X 2 {G) or, alternatively, by the probability 
to obtain an ANN output 77 nct with less than 0.05 absolute deviation from the 
actual value 77, 

PO.O5 = ^E (°- O5 -| ? ?( S )-^ct( 5 )|) • (2) 

Both ANN that have been used here independently, NETFIT and APROP, 
predict 77 from MTS with an accuracy that should be sufficient in most cases 
(Tab. 1). The same accuracy can be reached for the NFS spectra. In view of the 
expansion of the program for simultaneously obtaining several hyperfine parame- 
ters (and not only one as in the present case) it is desirable to keep the ANN and 
the computational effort during the training as small as possible. For this reason 
preprocessing of the Mossbauer spectra was introduced, i.e. before starting the 
ANN procedure the spectra were fitted by six Lorentzians with fixed linewidth. 
The position and the height of these Lorentzians together with AEq and the 
external field B are then presented to the ANN. As a result the performance, 
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Figure 1. Generalization ability: ANN output ?7 nc t vs the corresponding actual n for 1000 MTS 
spectra in set G. In (a) raw data and in (b) Lorentzian fits were used as ANN input. 

as measured by \ 2 an d Po.05> improves (Tab. 1, Fig. 1). At the same time the 
number of neurons in the input layer is reduced from 256 to 14. In summary we 
state that a relatively small ANN with 14 input neurons, 5 hidden neurons, and 
1 output neuron provides a reasonable estimate for rj. This situation is promis- 
ing in view of expanding the ANN procedure towards a fast and simultaneous 
multi-parameter analysis of Mossbauer spectra. 
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